Estimation of suspended sediment concentration and yield using linear models, random forests and quantile regression forests

نویسندگان

  • T. Francke
  • B. Schröder
چکیده

For sediment yield estimation, intermittent measurements of suspended sediment concentration (SSC) have to be interpolated to derive a continuous sedigraph. Traditionally, sediment rating curves (SRCs) based on univariate linear regression of discharge and SSC (or the logarithms thereof) are used but alternative approaches (e.g. fuzzy logic, artificial neural networks, etc.) exist. This paper presents a comparison of the applicability of traditional SRCs, generalized linear models (GLMs) and nonparametric regression using Random Forests (RF) and Quantile Regression Forests (QRF) applied to a dataset of SSC obtained for four subcatchments (0Ð08, 41, 145 and 445 km2) in the Central Spanish Pyrenees. The observed SSCs are highly variable and range over six orders of magnitude. For these data, traditional SRCs performed inadequately due to the over-simplification of relating SSC solely to discharge. Instead, the multitude of acting processes required more flexibility to model these nonlinear relationships. Thus, alternative advanced machine learning techniques that have been successfully applied in other disciplines were tested. GLMs provide the option of including other relevant process variables (e.g. rainfall intensities and temporal information) but require the selection of the most appropriate predictors. For the given datasets, the investigated variable selection methods produced inconsistent results. All proposed GLMs showed an inferior performance, whereas RF and QRF proved to be very robust and performed favourably for reproducing sediment dynamics. QRF additionally provides estimates on the accuracy of the predictions and thus allows the assessment of uncertainties in the estimated sediment yield that is not commonly found in other methods. The capabilities of RF and QRF concerning the interpretation of predictor effects are also outlined. Copyright  2008 John Wiley & Sons, Ltd.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Investigation of forest road surface sediment estimation using two experimental models of SEDMODL and WARSEM

Extended abstract 1- Introduction A bare surface on forest roads is created due to road construction. This surface is the main source of erosion and sediment yield to streams in forest areas. The increase of sediment in streams causes dramatic damage to the quality of water ecosystems and the life of aquatic organisms. Therefore, road engineers should pay attention not only to the cost of roa...

متن کامل

Determination of the Best Model to Estimate Suspended Sediment Load in Zaremrood River, Mazandaran Province

Extended abstract 1- Introduction The phenomena of erosion, sediment transport, and sedimentations have tremendously destructive effects on the environment and hydraulics structures. In general, the sediment transportation depends on river discharges, but the proposed equations inherited serious errors.  The estimation of suspended sediment load (SSL) is one of the most important factors in r...

متن کامل

Quantile Regression Forests

Abstract Random Forests were introduced as a Machine Learning tool in Breiman (2001) and have since proven to be very popular and powerful for high-dimensional regression and classification. For regression, Random Forests give an accurate approximation of the conditional mean of a response variable. It is shown here that Random Forests provide information about the full conditional distribution...

متن کامل

Estimating river suspended sediment yield using MLP neural network in arid and semi-arid basins Case study: Bar River, Neyshaboor, Iran

Abstract Erosion and sedimentation are the most complicated problems in hydrodynamic which are very important in water-related projects of arid and semi-arid basins. For this reason, the presence of suitable methods for good estimation of suspended sediment load of rivers is very valuable. Solving hydrodynamic equations related to these phenomenons and access to a mathematical-conceptual mode...

متن کامل

Extensions to Quantile Regression Forests for Very High-Dimensional Data

This paper describes new extensions to the state-of-the-art regression random forests Quantile Regression Forests (QRF) for applications to high dimensional data with thousands of features. We propose a new subspace sampling method that randomly samples a subset of features from two separate feature sets, one containing important features and the other one containing less important features. Th...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2008